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SEQUENCE LISTING 



HhUEIvED 

AUG 0 9 2001 
TECH CENTER 1600/2900 





1) GENERAL INFORMATION 



oN (i) APPLICANTS 
^ (A) NAME: Peter Stougaard 



Ole Cai Hansen 

'^l&m^^ TITLE 0F INVENTION: RECOMBINANT HEXOSE OXIDASE, A 

METHOD OF PRODUCING SAME AND USE OF SUCH ENZYME 



(iii) NUMBER OF SEQUENCES: 34 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Hunton & Williams 

(B) STREET: 1900 K Street, N.W. 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: U.S.A. 

(F) ZIP: 20006-1109 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/669,304 

(B) FILING DATE: 12 July 1996 

(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: - 

(A) APPLICATION NUMBER: 08/476,910 

(B) FILING DATE: 7 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Stanislaus Aksman 

(B) REGISTRATION NUMBER: 28,562 

(C) REFERENCE/DOCKET NO.: 54320.000003 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (202) 955-1926 

(B) TELEFAX: (202) 778-2201 

(C) TELEX: None 



(|) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 
i (A) LENGTH: 8 amino acids 

! (B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
Tyr Glu Pro Tyr Gly Gly Val Pro 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Ala lie lie Asn Val Thr Gly Leu Val Glu Ser Gly Tyr Asp Xaa Xaa 
1 . 5 10 15 

Xaa Gly Tyr Xaa Val Ser Ser 
20 

2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Asp Leu Pro Met Ser Pro Arg Gly Val lie Ala Ser Asn Leu Xaa Phe 
15 10 15 
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(?) INFORMATION FOR SEQ ID NO: 4: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 
j (D) TOPOLOGY: unknown 

j (ii) MOLECULE TYPE: peptide 

i (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

I 

Asp Ser Glu Gly Asn Asp Gly Glu Leu Phe Xaa Ala His Thr 
,1 .5 10 

12) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Tyr Tyr Phe Lys 
1 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids- 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Asp Pro Gly Tyr lie Val lie Asp Val Asn Ala Gly Thr Xaa Asp 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 




(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Leu Gin Tyr Gin Thr Tyr Trp Gin Glu Glu Asp 
15 10 

INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Xaa lie Arg Asp Phe Tyr Glu Glu Met 
1 5 

INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Ala lie lie Asn Val Thr Gly Leu Val Glu Ser Gly Tyr Asp Xaa . Xaa 
15 10 15 

Xaa Gly Tyr Xaa Val Ser Ser 
20 

INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
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J (ii) MOLECULE TYPE: peptide 

j (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

i 

| Asp Leu Pro Met Ser Pro Arg Gly Val lie Ala Ser Asn Leu Trp Phe 
15 10 15' 

2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
j (A) LENGTH: 14 amino acids 

f (B) TYPE: amino acid 

j (C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Asp Ser Glu Gly Asn Asp Gly Glu Leu Phe Xaa Ala His Thr 
15 10 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Tyr Tyr Phe Lys 
1 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Asd Pro Gly Tyr He Val He Asp Val Asn Ala Gly Thr Pro Asp 
1 5 10 15 

) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

j (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Leu Gin Tyr Gin Thr Tyr Trp Gin Glu Glu Asp 

' 1 5 10 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Xaa He Arg Asp Phe Tyr Glu Glu Met 
1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(ix) FEATURE: 

(A) NAME /KEY: modified base; N=inosine 

(B) LOCATION: base pairs 3, 6 and 12 

(C) IDENTIFICATION METHOD: commercially available 

(D) OTHER INFORMATION 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
YTNGTNGARW SNGGNTAYGA 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(ix) FEATURE: 

(A) NAME/KEY: modified base; N=inosine 

(B) LOCATION: base pairs 6 and 12 

(C) IDENTIFICATION METHOD: commercially availabl 

(D) OTHER INFORMATION 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AACCANARRT TNGANGCDAT NAC 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(ix) FEATURE: 

(A) NAME/KEY: modified base; N=inosine 

(B) LOCATION: base pairs 6 and 15 

(C) IDENTIFICATION METHOD: commercially availabl 

(D) OTHER INFORMATION 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GARGGNAAYG AYGGNGARCT NTT 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single ' 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 

(ix) FEATURE: 

(A) NAME/KEY: modified base; N=inosine 

(B) LOCATION: base pairs 3 and 9 

(C) IDENTIFICATION METHOD: commercially available 

(D) OTHER INFORMATION 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
AANAGYTCNC CRTCRTTNCC YTC 23 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
ATTGGGGCTC CTTCAAGACC TT 22 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TGATGATTCC AAAGTTTC 18 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 

TTGGAAGAAT ACGGTTGG 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

TACTATTTCG TCTGCTTGGG 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

GAACTCTTCC GTGGTCTCCT 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
CCACCTGCGT GTTGGGGTCT 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
CAGATCTACA AAACATGCGA G 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 

TGTCGCAGAC TGTACTTG 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single - 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

GAGTGTACAC GACATAAA 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
ATGGCTACTC TTCCCCAGAA AG 22 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1801 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 84. .1721 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

TGAATTCGTG GGTCGAAGAG CCCTTTGCCT CGTCTCTCTG GTACCGTGTA TGTCAAAGGT 60 

TCGCTTGCAC ACTGAACTTC ACG ATG GCT ACT CTT CCT CAG AAA GAC CCC 110 

Met Ala Thr Leu Pro Gin Lys Asp Pro 
1 5 

GGT TAT ATT GTA ATT GAT GTC AAC GCG GGC ACC GCG GAC AAG CCG GAC 158 
Gly Tyr He Val He Asp Val Asn Ala Gly Thr Ala Asp Lys Pro Asp 
10 15 20 25 

CCA CGT CTC CCC TCC ATG AAG CAG GGC TTC AAC CGC CGC TGG ATT GGA 206 
Pro Arg Leu Pro Ser Met Lys Gin Gly-Phe Asn Arg Arg Trp He Gly 

30 35 40 

ACT AAT ATC GAT TTC GTT TAT GTC GTG TAC ACT CCT CAA GGT GCT TGT 254 
Thr Asn He Asp Phe Val Tyr Val Val Tyr Thr Pro Gin Gly Ala Cys 
45 50 55 

ACT GCA CTT GAC CGT GCT ATG GAA AAG TGT TCT CCC GGT ACA GTC AGG 302 
Thr Ala Leu Asp Arg Ala Met Glu Lys Cys Ser Pro Gly Thr Val Arg 
60 65 70 



ATC GTC TCT GGC GGC CAT TGC TAC GAG GAC TTC GTA TTT GAC GAA TGC 
He Val Ser Gly Gly His Cys Tyr Glu Asp Phe Val Phe Asp Glu Cys 
75 80 85 



350 
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GTC AAG GCC 
Val Lys Ala 
90 

GAC GAT AGG 
Asp Asp Arg 



TTC AAG ACC 
Phe Lys Thr 



TGC TAC TCC 
Cys Tyr Ser 
140 

ATT TTG GCC 
lie Leu Ala 
155 

GAG GTC GTC 
Glu Val Val 
170 

GTG CAC AAA 
Val His Lys 



ACA GGT GGC 
Thr Gly Gly 



AAG GAT TTG 
Lys Asp Leu 
220 

TTC AGC TGG 
Phe Ser Trp 
235 

AAG TAC TTC 
Lys Tyr Phe 
250 

TTT CAA ATC 
Phe Gin He 



ATC ATC 
He He 



GGT TAC 
Glv Tyr 
110 

TTG TTC 
Leu Phe 
125 

GTC GGC 
Val Gly 



AAC GTC ACT 
Asn Val Thr 
95 

TTC GTC AGC 
Phe Val Ser 



AGA GAC CAC 
Arg Asp His 



CGC TTG 
Arg Leu 

GTT AAG 
Val Lys 



GAT TCC 
Asp Ser 
190 

GGT GGC 
Gly Gly 
205 

CCC ATG 
Pro Met 



GAC GGT 
Asp Gly 

AAA CTT 
Lys Leu 



TTC CAT 
Phe His 
270 



CTC GGT GGC 
Leu Gly Gly 
145 

CAT GGC CTC 
His Gly Leu 
160 

CCA GTC CTC 
Pro Val Leu 
175 

GAA GGC AAC 
Glu Gly Asn 



GGA AAC TTT 
Gly Asn Phe 



TCT CCA CGG 
Ser Pro Arg 
225 

TTC ACG AGA 
Phe Thr Arg 
240 

GCC AGA TGT 
Ala Arg Cys 
255 

CAG GCA GCG 
Gin Ala Ala 



GGT CTC 
Gly Leu 



AGT GGA 
Ser Gly 
115 

GGA AGA 
Gly Arg 
130 

CAC ATT 
His He 



GTT GAG AGT GGT 
Val Glu Ser Gly 
100 

GAT ACA AAT TGG 
Asp Thr Asn Trp 



CCC GTC 
Pro Val 



ACC GAA 
Thr Glu 



GAC GGG 
Asp Gly 
195 

GGA ATC 
Gly He 
210 

GGC GTC 
Gly -Val 



GAT GCC 
Asp Ala 



GAT TGG 
Asp Trp 



GTT CTT CCC GGG 
Val Leu Pro Gly 
135 

GTC GGC GGA GGT 
Val Gly Gly Gly 
150 

GAT TGG CTC AGC 
Asp Trp Leu Ser 
165 

GAC TCG GTA CTC 
Asp Ser Val Leu 
180 

GAG CTC TTT TGG 
Glu Leu Phe Trp 



TAT GAC 
Tyr Asp 
105 

GGC TCC 
Gly Ser 
120 

GGT TCC 
Gly Ser 



GAA GAG 
Glu Glu 
275 



ATC ACC AAA TAC 
He Thr Lys Tyr 
215 

ATC GCA TCA AAT 
He Ala Ser Asn 
230 

TTG CAG GAT TTG 
Leu Gin Asp Leu 
245 

AAG AAT ACG GTT 
Lys Asn Thr Val 
260 

TTT GTC ATG TAC 
Phe Val Met Tyr 



GAC GGC 
Asp Gly 



GGC GTG 
Gly Val 



AAG TAT 
Lys Tyr 
185 

GCA CAC 
Ala His 
200 

TAC TTC 
Tyr Phe 



TTA CAC 
Leu His 



TTG ACA 
Leu Thr 



GGC AAG 
Gly Lys 
265 

TTG TAT 
Leu Tyr 
280 



398 



446 



494 



542 



590 



638 



686 



734 



782 



830 



878 



926 



96 



AC A TCC TAC 
Thr Ser Tyr 



TAT CAT TTG 
Tyr His Leu 
300 

ACC AAA GCG 
Thr Lys Ala 
315 

CCG CGC AAG 
Pro Arg Lys 
330 

ATG GAC TAC 
Met Asp Tyr 



CCG AAT CAG 
Pro Asn Gin 



CCG GAT TTC 
Pro Asp Phe 
380 

GAC GGC TTG 
Asp Gly Leu 
395 

ATG TTT GGT 
Met Phe Gly 
410 

GCG CAG CGC 
Ala Gin Arg 



GAA GAA GAC 
Glu Glu Asp 



GAG GAG ATG 
Glu Glu Met 
460 



TCG AAC GAC GCC GAG CGC GAA GTT GCC 
Ser Asn Asp Ala Glu Arg Glu Val Ala 
285 290 

GAG GCT GAC ATA GAA CAG ATC TAC AAA 
Glu Ala Asp He Glu Gin He Tyr Lys 

305 

CTT GGC GGG CAT GCT GGG TGG GCG CCG 
Leu Gly Gly His Ala Gly Trp Ala Pro 
.320 325 

AGG CAC ACA TCC AAG ACG TCG TAT ATG 
Arg His .Thr Ser Lys Thr Ser Tyr Met 
335 340 

CCC TTC TAC GCG CTC ACT GAG ACG ATC 
Pro Phe Tyr Ala Leu Thr Glu Thr He 
350 355 

CGC GGC AAG TAC AAG TCT GCG TAC ATG 
Arg Gly Lys Tyr Lys Ser Ala Tyr Met 
365 370 

CAG ATC GAC GTG ATC TGG AAA TAC CTT 
Gin He Asp Val He Trp Lys Tyr Leu 

385 

ACT AGT GCC GAA ATG AAG GAT GCC TTA 
Thr Ser Ala Glu Met Lys Asp Ala Leu 
400 405 

GGT GAG ATT CAC AAG GTG GTC TGG GAT 
Gly Glu He His Lys Val -Val Trp Asp 
415 420 

GAG TAC ATC ATC AAA CTG CAG TAC CAG 
Glu Tyr He He Lys Leu Gin Tyr Gin 
430 435 

AAG GAT GCA GTG AAC CTC AAG TGG ATT 
Lys Asp Ala Val Asn Leu Lys Trp He 
445 450 

TAT GAG CCG TAT GGC GGG GTT CCA GAC 
Tyr Glu Pro Tyr Gly Gly Val Pro Asp 

465 



CAA GAC CGT CAC 97 4 
Gin Asp Arg His 
295 

ACA TGC GAG CCC 1022 

Thr Cys Glu Pro 

310 

TTC CCC GTG CGG 1070 
Phe Pro Val Arg 



CAT GAC GAG ACG 1118 
His Asp Glu Thr 
345 

AAC GGC TCC GGG 1166 
Asn Gly Ser Gly 
360 

ATC AAG GAT TTC 1214 
He Lys Asp Phe 
375 

ACG GAG GTC CCG 1262 

Thr Glu Val Pro 

390 

CTC CAG GTG GAC 1310 
Leu Gin Val Asp 



GCG ACG GCA GTC 1358 
Ala Thr Ala Val 
425 

ACA TAC TGG CAG 1406 
Thr Tyr Trp Gin 
440 

AGA GAC TTT TAC 1454 
Arg Asp Phe Tyr 
455 

CCC AAC ACG CAG 1502 

Pro Asn Thr Gin 

470 
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GTG GAG AGT GGT AAA GGT GTG TTT GAG GGA TGC TAC TTC AAC TAC CCG 1550 
Val Glu Ser Gly Lys Gly Val Phe Glu Gly Cys Tyr ?he Asn Tyr Pro 
475 480 485 

GAT GTG GAC TTG AAC AAC TGG AAG AAC GGC AAG TAT GGT GCC CTC GAA 1598 
Asp Val Asp Leu Asn Asn Trp Lys Asn Gly Lys Tyr Gly Ala Leu Glu 
490 495 500 505 

CTT TAC TTT. TTG GGT AAC CTG AAC CGC CTC ATC AAG GCC AAA TGG TTG 164 6 
Leu Tyr Phe Leu Gly Asn Leu Asn Arg Leu lie Lys Ala Lys Trp Leu 

510 515 520 

TGG GAT CCC AAC GAG ATC TTC ACA AAC AAA CAG AGC ATC CCT ACT AAA 1694 
TrD Asp Pro Asn Glu lie Phe Thr Asn Lys Gin Ser lie Pro Thr Lys 
525 530 535 

CCT CTT AAG GAG CCC AAG CAG ACG AAA TAGTAGGTCA CAATTAGTCA 1741 
Pro Leu Lys Glu Pro Lys Gin Thr Lys 
540 545 

TCGACTGAAG TGCAGCACTT GTCGGATACG GCGTGATGGT TGCTTTTTAT AAACTTGGTA 1801 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 546 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Ala Thr Leu Pro Gin Lys Asp Pro -Gly Tyr He Val He Asp Val 
15 10 15 

Asn Ala Gly Thr Ala Asp Lys Pro Asp Pro Arg Leu Pro Ser Met Lys 
20 25 30 

Gin Gly Phe Asn Arg Arg Trp He Gly Thr Asn He Asp Phe Val Tyr 
35 40 45 

Val Val Tyr Thr Pro Gin Gly Ala Cys Thr Ala Leu Asp Arg Ala Met 
50 55 60 

Glu Lys Cys Ser Pro Gly Thr Val Arg He Val Ser Gly Gly His Cys 
. 65 70 75 80 

Tyr Glu Asp Phe Val Phe Asp Glu Cys Val Lys Ala He He Asn Val 

85 90 95 
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Thr Gly Leu Val Glu Ser Gly Tyr Asp Asp Asp Arg Gly Tyr Phe Val 
100 105 110 

Ser Ser Gly Asp Thr Asn Trp Gly Ser Phe Lys Thr Leu Phe Arg Asp 
115 120 125 

His Gly Arg Val Leu Pro Gly Gly Ser Cys Tyr Ser Val Gly Leu Gly 
130 135 140 

Gly His He Val Gly Gly Gly Asp Gly He Leu Ala Arg Leu His Gly 
145 150 155 160 

Leu Pro Val Asp Trp Leu Ser Gly Val Glu Val Val Val Lys Pro Val 

165 170 175 

Leu Thr Glu Asp Ser Val Leu Lys Tyr Val His Lys Asp Ser Glu Gly 
180 185 190 

Asn Asp Gly Glu Leu Phe Trp Ala His Thr Gly Gly Gly Gly Gly Asn 

195 200 205 

Phe Gly He He Thr Lys Tyr Tyr Phe Lys Asp Leu Pro Met Ser Pro 
210 215 220 

Arg Gly Val He Ala Ser Asn Leu His Phe Ser Trp Asp Gly Phe Thr 
225 230 235 240 

Arg Asp Ala Leu Gin Asp Leu Leu Thr Lys Tyr Phe Lys Leu Ala Arg 

245 250 255 

Cys Asd Trp Lys Asn Thr Val Gly Lys Phe Gin He Phe His Gin Ala 
260 265 270 

Ala Glu Glu Phe Val Met Tyr Leu Tyr Thr Ser Tyr Ser Asn Asp Ala 
275 280 - 285 

Glu Arg Glu Val Ala Gin Asp Arg. His Tyr His Leu Glu Ala Asp He 
290 295 300 

Glu Gin He Tyr Lys Thr Cys Glu Pro Thr Lys Ala Leu Gly Gly His 
305 310 315 320 

Ala Gly Trp Ala Pro Phe Pro Val Arg Pro Arg Lys Arg His Thr Ser 

325 330 335 

Lys Thr Ser Tyr Met His Asp Glu Thr Met Asp Tyr Pro Phe Tyr Ala 
340 345 350 

Leu Thr Glu Thr He Asn Gly Ser Gly Pro Asn Gin Arg Gly Lys Tyr 
355 360 365 
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Lys 


Ser 
370 


Ala 


Tyr 


Met 


He 


Lys 
375 


Asp 


Phe 


Pro 


Asp 


Phs 
380 


Gin 


lie 


Asp 


Val 


lie 
385 


Trp 


Lys 


Tyr 


Leu 


Thr 
390 


Glu 


Val 


Pro 


Asp Gly 
395 


Leu 


Thr 


Ser 


Ala 


Glu 
400 


Mat 


Lys 


Asp 


Ala 


Leu 
405 


Leu 


Gin 


Val 


Asp 


Met 
410 


Phe 


Gly Gly 


Glu 


He 
415 


His 


Lys 


Val 


Val 


Trp 
420 


Asp 


Ala 


Thr 


Ala 


Val 
425 


Ala 


Gin 


Arg 


Glu 


Tyr 
430 


He 


He 


Lys 


Leu 


Gin 
435 


Tyr 


Gin 


Thr 


Tyr 


Trp 
440 


Gin 


Glu 


Glu 


Asp 


Lys 
445 


Asp 


Ala 


Val 


Asn 


Leu 
450 


Lys 


Trp 


He 


Arg 


Asp 
455 


Phe 


Tyr 


Glu 


Glu 


Met 
460 


Tyr 


Glu 


Pro 


Tyr 


Gly Gly 
465 

Phe Glu 


Val 
Gly 


Pro 
Cys 


Asp 

Tyr 
485 


Pro 
470 
Phe 


Asn 
Asn 


Thr 
Tyr 


Gin 
Pro 


Val 

Asp 
490 


Glu 
475 
Val 


Ser Gly 
Asp Leu 


Lys 
Asn 


Gly 

Asn 
495 


Val 
480 
Trp 



Lys Asn Gly Lys Tyr 
500 

Asn Arg Leu He Lys 
515 

Thr Asn Lys Gin Ser 
530 



Gly Ala Leu Glu Leu Tyr 
505 

Ala Lys Trp Leu Trp Asp 
520 

He Pro Thr Lys Pro Leu 
535 



Phe Leu Gly Asn Leu 
510 

Pro Asn Glu He Phe 
525 

Lys Glu Pro Lys Gin 
540 



Thr Lys 
545 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
ACCAAGTTTA TAAAAAGCAA CCATCAC 27 
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(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

ATGAATTCGT GGGTCGAAGA GCCC 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
CAGGAATTCA TATGGCTACT CTTCCCCAGA AAG 



